Improvement of CRF-Based Accent Sandhi Prediction Using The Features Derived from Accent Rules
نویسندگان
چکیده
When developing Japanese text-to-speech (TTS) systems, algorithms to accurately predict accent types of each constituent phrase is essential for better output speech quality. In our previous studies on the accent type estimation, a CRF-based method was realized. Although this method outperformed the conventional rule-based method, the estimation accuracy of particular phrases such as those including numerals or loanwords was still not sufficient. In this paper, we newly added the features used in the rule-based estimation of these phrases as CRF features. The experimental result for JNAS corpus showed improvements in accent type estimation. As an example of possible applications of the developed method other than speech synthesis, we constructed an accent type prediction module for CALL systems. This module can automatically generate accent dictionaries of conjugation words for any Japanese texts.
منابع مشابه
Improved Prediction of Japanese Word Accent Sandhi Using CRF
In Japanese, every content word has its own mora-based H/L pitch pattern when it is uttered in isolation, called accent type. When reading out a written sentence, however, this lexical H/L pattern is often changed according to the context, known as word accent sandhi. In our previous work, an accent sandhi predictor was developed using CRF [1], and in this paper, the predictor is improved throu...
متن کاملCRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems
In Japanese, every content word has its own H/L pitch pattern when it is uttered isolatedly, called accent type. In a TTS system, this lexical information is usually stored in a dictionary and it is referred to for prosody generation. When converting a written sentence to speech, however, this lexical H/L pattern is often changed according to the context, known as word accent sandhi. This accen...
متن کاملAutomatic prosodic labeling of accent information for Japanese spoken sentences
This paper describes a method of automatic labeling of prosodic information focusing on accent types and accent phrase boundaries for Japanese spoken sentences. They are predicted by CRF (Conditional Random Fields) using linguistic information and F0 contour information. In the prediction of the accent type, we propose a method that uses a provisional accent type predicted by linguistic informa...
متن کاملAccent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields
When synthesizing speech from Japanese text, correct assignment of accent nuclei for input text with arbitrary contents is indispensable in obtaining naturally-sounding synthetic speech. A phenomenon called accent sandhi occurs in utterances of Japanese; when a word is uttered in a sentence, its accent nucleus may change depending on the contexts of preceding/succeeding words. This paper descri...
متن کاملDevelopment and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody
This paper develops an online and freely available framework to aid teaching and learning the prosodic control of Tokyo Japanese: how to generate its adequate word accent and phrase intonation. This framework is called OJAD (Online Japanese Accent Dictionary) [1] and it provides three features. 1) Visual, auditory, systematic, and comprehensive illustration of patterns of accent change (accent ...
متن کامل